Op-cbio120233 2711..2718

نویسندگان

  • Shu Mei Teo
  • Yudi Pawitan
  • Chee Seng Ku
  • Kee Seng Chia
  • Agus Salim
چکیده

Motivation: Analysing next-generation sequencing (NGS) data for copy number variations (CNVs) detection is a relatively new and challenging field, with no accepted standard protocols or quality control measures so far. There are by now several algorithms developed for each of the four broad methods for CNV detection using NGS, namely the depth of coverage (DOC), read-pair, split-read and assembly-based methods. However, because of the complexity of the genome and the short read lengths from NGS technology, there are still many challenges associated with the analysis of NGS data for CNVs, no matter which method or algorithm is used. Results: In this review, we describe and discuss areas of potential biases in CNV detection for each of the four methods. In particular, we focus on issues pertaining to (i) mappability, (ii) GC-content bias, (iii) quality control measures of reads and (iv) difficulty in identifying duplications. To gain insights to some of the issues discussed, we also download real data from the 1000 Genomes Project and analyse its DOC data. We show examples of how reads in repeated regions can affect CNV detection, demonstrate current GC-correction algorithms, investigate sensitivity of DOC algorithm before and after quality control of reads and discuss reasons for which duplications are harder to detect than deletions. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online. Received on May 30, 2012; revised on August 1, 2012; accepted on

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Isolation and purification of an extracellular protease from a new strain of Bacillus subtilis, viz.NCIM 2711.

A new extracellular protease having a prospective application in the food industry was isolated from Bacillus sUbtilis NCIM 2711 by (NH4)2SO4 precipitation from the cell broth. It was purified using DEAE-Cellulose and CM-Sephadex C-50 ion-exchange chromatography. With casein as a substrate, the proteolytic activity of the purified protease was found to be optimal at pH 7.0 and temperature 55 de...

متن کامل

Effects of 12-sulfodehydroabietic acid monosodium salt (TA-2711), a new anti-ulcer agent, on gastric mucosal lesions induced by necrotizing agents and gastric mucosal defensive factors in rats.

Effects of TA-2711 on gastric mucosal lesions induced by various necrotizing agents and several defensive factors of gastric mucosa were investigated in rats. Oral administration of TA-2711 at 12.5 to 200 mg/kg prevented the formation of gastric mucosal lesions induced by 99.5% ethanol, 0.6 N HCl, 0.2 N NaOH and boiling water with ED50 values of 24, 58, 16 and 101 mg/kg, respectively. Oral TA-2...

متن کامل

Joint large deviation result for empirical measures of the coloured random geometric graphs

We prove joint large deviation principle for the empirical pair measure and empirical locality measure of the near intermediate coloured random geometric graph models on n points picked uniformly in a d-dimensional torus of a unit circumference. From this result we obtain large deviation principles for the number of edges per vertex, the degree distribution and the proportion of isolated vertic...

متن کامل

Police, Equity, and Child Health.

, number 3 , March 2016 :e 20152711 From Oakland and Ferguson, to Cleveland and Baltimore, cities across the country mourn young African-Americans whose tragic deaths, following contentious encounters with police, illustrate the violent exchange that can erupt between law enforcement and people of color. Because police are vital pillars of community safety, these events raise important question...

متن کامل

Target-Bidirectional Neural Models for Machine Transliteration

Our purely neural network-based system represents a paradigm shift away from the techniques based on phrase-based statistical machine translation we have used in the past. The approach exploits the agreement between a pair of target-bidirectional LSTMs, in order to generate balanced targets with both good suffixes and good prefixes. The evaluation results show that the method is able to match a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012